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Device for marking and restoring multimedia signals 

The present invention relates to a device for marking and restoring multimedia signals. 

Marking of a multimedia signal, a process also known as watermarking, involves 
5 invisibly embedding a message in the multimedia signal before it is transmitted so as to 
be able to restore it in a legible manner on reception. To ensure the secrecy of the 
embedded message, a set of private or public keys is often used to deny unauthorised 
persons the possibility of finding or removing the hidden message. 
There are numerous application domains for a method of marking multimedia signals. 

10 Firstly, in a protection context, it can be useful to insert a hidden message into the 
content of a multimedia signal making it possible to subsequently identify the content, 
to identify the owner of the content, or to determine the rules governing the use of this 
content, such as distribution rights or copyright, for example. 

However, the content of the multimedia message can be degraded in various ways. For 
1 5 example, it can be degraded following the use of a representation format that introduces 
degradation, such as lossy coding (for example JPEG for fixed images, MPEG for 
video, or MP3 for audio), or by various acquisition methods such as analogue recording, 
printing, or scanning in the case of an image. 

The content of a multimedia signal can also be degraded by reformatting, for example 

20 by selecting a portion of an audio file or cropping an image. 

The content of a multimedia signal can also be subject to intentional attacks with the 
aim of defeating the message extraction process. This can be done by adding noise to 
the signal, by using a filtering technique or by using desynchronising techniques (for 
example, geometric transformation in the case of images, or change of frequency in the 

25 case of sound files). In this kind of application, it is important to ensure that the 
embedded message can be extracted correctly regardless of whether or not the content 
has been intentionally modified. 

Another application domain relates to the provision, by means of a watermarking 
process, of a channel for the transmission of information in an imperceptible manner 
30 and linked to the actual content of the multimedia signals. In particular, this can be 
useful in the case of transcoding or subsequent dissemination of the content, where the 
existence and/or the long-term viability of such a transmission channel is not 
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guaranteed. This side channel can then be used, depending on its capacity, to transmit 
any useful information. By way of example, this can include the insertion of meta-data 
describing the watermarked content (such as a content identifier or a description of 
content elements) which can subsequently be used to provide a value added service, or 
ancillary information (such as a teletext service or subtitles). Here again, it is important 
to be able to extract this information after the content has been manipulated in various 
ways, principally transcoding, arid therefore to have a robust watermarking system. 

Known marking devices rely on a COFDM type modulation technique, commonly used 
in digital communications, wherein bits bj define the message and are modulated by 
several carriers defined by public and private keys. The signal thus modulated is added 
to the original signal. On extraction, demodulation is used to restore the inserted bits bj. 
However this marking technique suffers from a number of imperfections in that the host 
signal can interfere with the carriers used, the inserted signal may be visible, or the 
resynchronisation may be imperfect. 

The aim of the invention is to remedy this situation. 

To this end, the invention proposes a signal processing device including a signal 
transformation module capable of producing a transformed signal from an original 
signal and a mixing module intended to mark the transformed signal with a marking 
message. According to a characteristic of the invention, the mixing module includes: 

- a formatting module capable of calculating a response of the transformed signal to the 
demodulation of a first set of carriers defined by keys protecting the message, and of 
calculating a marking information based on this response and code words associated 
with the marking message, 

- a modulator capable of modulating the marking data supplied by the formatting 
module with a given coefficient of the carriers of the first set of carriers, and of 
modulating in amplitude the resulting coefficient by a corresponding quantity related to 
the energy weighting term of the marking message and to the set of carriers, thereby 
supplying a marking coefficient, 

- an adder capable of adding the marking coefficient to the corresponding coefficient of 
the transformed original signal. 

The amplitude modulation performed by the modulator thus enables the added signal to 
be rendered hardly visible. Moreover, the device proposed by the invention implements 
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a channel coding technique with side information. In this technique, the components of 
the marking data are floating values defined in a manner such that their insertion 
compensates the response of the host signal. 

According to another characteristic of the invention, the formatting module includes a 
5 demodulator intended to perform the demodulation, this demodulator being capable of 
■ multiplying each coefficient of the transformed signal by the corresponding coefficient 
of a given carrier in the first set of carriers, by the perceptual weight of distortion and by 
the attenuation factor associated with the transformed signal coefficient, and adding the 
coefficients thus determined, thereby supplying a component of the response of the 
10 transformed signal. 

The formatting module is also capable of calculating the marking information from a 
predetermined parameter, a first vector associated with a particular code word of the 
marking message, and a second vector forming in conjunction with said first vector a 
normalised orthogonal base defining a hyperplane. 

15 In particular, the particular code word is obtained by minimising a quadratic error 
criterion between the code words associated with the marking message and the 
normalised value of the response of the transformed signal to the demodulation. 

Each component of the second vector is proportional to the difference between the 
corresponding component of the demodulation response and the projection of the vector 
20 representing the demodulation response on a unit vector colinear with the first vector. 

The predetermined parameter corresponds to the angle between the vector representing 
the marking information and the first vector, this parameter being determined by 
maximising the relationship: 

K.(uo + cos 9 f - (vo + sin G) 2 

25 in which: 

- uo denotes the scalar product between the vector representing the demodulation 
response and the first vector, divided by the number m of components of the 
demodulation response, 

- vo denotes the scalar product between the vector representing the demodulation 
30 response and the second vector, divided by the number m, 
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- K = 1/(2 2(C+R)m -1), C and R respectively denoting the number of useful bits and 
adaptation bits to the original signal, and m denotes the number of components of the 
demodulation response. 

According to another characteristic of the invention, the mixer includes a scaling 
5 module capable of modulating in amplitude each signal coefficient supplied by the 
adder circuit by a quantity related to the energy weighting term of the marking message 
and the variance of the corresponding coefficient of the transformed signal. 

This quantity is defined by a xi 2 / (a xi 2 + a wi 2 ), where c xi 2 is the term defining the energy 
of the marking message and o xi 2 is the variance of the corresponding coefficient of the 
10 transformed signal. 

This amplitude modulation corresponds to a Wiener filter and serves to limit the noise 
thus added to the host signal. 

According to another characteristic of the invention, the device includes an inverse 
transformation module at the mixer output, capable of performing an inverse 
15 transformation on the marked signal relative to that performed by the transformation 
module, and a signal transformation module capable of transforming the resynchronised 
marked signal, thereby supplying a transformed marked signal. 

The device can also include an extraction device at the output of the inverse 
transformation module to extract the message from the marked signal, this extraction 
20 device incorporating a ^synchronisation module capable of resynchronising the marked 
signal. 

In particular, the extraction device is capable of calculating a response of the 
resynchronised marked signal to the demodulation of a second set of carriers defined by 
message protection keys, which provides an estimation of the embedded marking 
25 information. 

In an alternative embodiment, the first set of carriers and the second set of carriers are 
identical. 

Furthermore, the extraction device can include a demodulator intended to perform the 
demodulation, this demodulator being capable of multiplying each coefficient of the 
30 resynchronised marked signal by the corresponding coefficient of a given carrier in the 
second set of carriers and by the perceptual weight of distortion associated with said 
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coefficient of the resynchronised marked signal, and of adding the coefficients thus 
determined, which supplies one component of the marking information estimate. 
In addition, the extraction device can include a carrier generating module capable of 
generating the second set of carriers from the message protection keys. 
The extraction device can also include a decoder capable of determining the code word 
closest to the marking information estimate by maximising a quadratic error criterion 
between a set of code words and the marking information estimate, which supplies the 
marking message. 

According to another characteristic of the invention, the processing device can also 
include an insertion parameters definition module coupled to the mixing module capable 
of determining the energy weighting term of the marking message and the attenuation 
factor from the intrinsic signal properties, the application domain constraints, and the 
properties of the transformation used. 

In particular, the insertion parameters definition module is capable of calculating two 
global insertion parameters in relation to the insertion distortion between the 
original signal and the marked signal in the transform space, the maximum allowable 
attack distortion D^. between the original signal and the resynchronised marked signal, 
in the transform space, and the signal to noise ratio between the energy of the marking 
message and the attack noise Eb/No. 

The two global insertion parameters are calculated by searching for the parameters X and 
X which maximise the relationship: 

Eb/No + X Dxy-- X Dxy 

The insertion parameters definition module is capable of calculating the energy 
weighting term of the marking message and the attenuation factor based on the two 
25 global insertion parameters thus determined. 

Other characteristics and advantages of the invention will become apparent by reading 
the following description and by reference to the figures in the attached diagrams in 
which: 

- Fig. 1 illustrates the configuration of a system for the transmission of marked 
30 multimedia signals for implementation of the invention, 

- Fig. 2 is a general arrangement of the insertion device in Fig. 1 , 



20 



WO 2004/043072 



6 



PCT/FR2003/003208 



- ¥ig. 3 is a general arrangement of the extraction device in Fig. 1 5 

- Fig. 4 is a block diagram of the insertion module in Fig. 2, 

- Fig. 5 is a block diagram of the mixing module in Fig. 4, 

- Fig. 6 is a graphical representation enabling the robustness of a signal to be assessed, 
5 following the addition of noise of given energy, 

- Fig. 7 is a block diagram of an embodiment of the extraction module in Fig. 3, and 

- Fig. 8 depicts the mechanism used in one embodiment. 

Appendix I lists the various notations used in the description. 

Appendix II lists the mathematical formulae used in the description. 

10 The figures and the attachments to the description essentially include elements that are 
certain in character. They can therefore serve not only to aid understanding of the 
description, but will also contribute to the definition of the invention, as applicable. 

The device for marking and restoring multimedia signals for implementation of the 
invention, depicted diagrammatically in Fig. 1, comprises a marker message insertion 
15 device 1 and a marker message extraction device 2. 

The message insertion device 1 generates a marking for a multimedia signal S to be 
transmitted through an application domain 3, based on the content of a marker message 
M. The marking technique used is an additive technique implementing a spread 
spectrum modulation process. It is similar to the COFDM modulation technique 
20 commonly used in digital communications. The components bj which define the marker 
message M are modulated by carriers defined by public and private keys, and applied to 
the input of the insertion device. The signal thus modulated is added to the original 
signal S. On extraction, a demodulation process is applied to restore the embedded 
components bj of the marker message. 

25 According to an advantageous characteristic of the invention, to impart sufficient 
robustness and to ensure that the embedded signal is not visible, the added signal is 
modulated in amplitude as a function of the energy of the mark added to each signal 
coefficient in the transform domain. Following this addition, a further amplitude 
modulation is applied to each marked coefficient. This second modulation corresponds 

30 to a Wiener filter intended to limit the noise thus added to the host signal. 
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Conventionally, the components bj correspond to the bits defining the message to be 
embedded after the possible application of correction codes. In the scheme presented 
here, a channel coding technique with side information is used. The components bj of 
this marking model are floating value data in this case. 

The marking process described below takes such a marking model into account and 
optimises it so as to resist attacks of the noise addition, filtering and partial 
desynchronisation type, modelling quite well the various processes to which a signal 
may be subjected. 

The insertion device depicted in Fig. 2 includes an insertion module 4 coupled upstream 
to a transformation module 5 and downstream to an inverse transformation module 6. In 
this configuration, the original signal S, defined in a first space, is applied to the 
transformation module 5 to be transformed into a number n of coefficients xi, defined in 
a second space. Any transformation process can be used, not excluding identity 
transformation which involves working directly on the original signal. Different 
transformations can be used, such as Fourier transformation, discrete cosine 
transformation, or wavelet transformation for example. 

After transformation of the original signal S, the message M to be embedded is applied 
in the insertion module 4 to the different coefficients xi of the transformed signal to 
form marked coefficients yi. The marked coefficients yi are then applied to the inverse 
transformation module 6 to undergo inverse transformation relative to that applied 
before marking, thereby restoring a marked signal close to the original signal. This 
marked signal is then transmitted to an extraction device, as depicted in Fig. 3. 

In Fig. 3, the extraction device 2, which is shown enclosed within a dotted line, includes 
a transformation module 7 coupled upstream to a ^synchronisation module 8 and 
downstream to an extraction module 9. The marked signal received is first 
resynchronised by the resynchronisation module 8, then transformed by the 
transformation module 7 into a series of coefficients yi 1 using a transformation identical 
to that applied in the insertion phase. The coefficients yi' are then applied to the 
extraction device 9 to extract the marking signal M. Any resynchronisation process can 
be used (exhaustive search related to the insertion of a pilot signal or to an intrinsic 
property of the mark), or it may be implicit by virtue of insertion into a domain invariant 
to desynchronisations (for example, amplitudes in a Fourier domain or Fourier-Mellin 
transformation). 
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In the following description, the notations listed in Appendix I are used. 

An embodiment of the insertion module 4 is depicted in Fig. 4, enclosed within a dotted 
line. This module includes a mixing module 10, a signal analysis module 1 1, an intrinsic 
properties analysis module 12, and a global insertion parameters definition module 13. 

The insertion of a message M into a signal with coefficients xi begins in module 1 1 by 
an analysis which serves to define the signal-related properties, i.e. the perceptual 
weighting in the distortion metric (pi, defined for each coefficient xi of the transformed 
original signal as a function of the variance value o xi 2 of the corresponding coefficient. 
The perceptual weighting (pi of each coefficient xi of the signal is a function of the type 
of signal processed, the transformation used and the values of the observed signal. 

Any method can be used to estimate the variances a X i 2 of the signal (Appendix 1-1). It is 
possible, for example, to use a weighted quadratic mean in a vicinity (or sliding 
quadratic mean), according to relationship (2) in Appendix II to the description. In this 
relationship, vi denotes a vicinity of the coefficient in question. 

The naive value (pi = 1 corresponds to the conventional mean quadratic error. An 
example of a model more adapted to images taking account of the masking phenomenon 
can be defined by relationship (3) presented in Appendix II to the description. In this 
relationship, a b i 2 corresponds to a visibility threshold for the i-th coefficient, and Vi 
corresponds to a local masking force factor defined by a sliding mean on the vicinity vi 
of the coefficient considered, according to relationship (4) in Appendix II. p is a 
parameter in the order of 0.5 to 1 (typically the values 0.5, 0.6 and 0.7 are most 
commonly used). 

Based on the application constraints and the properties of the transformation used, 
application parameters ai, bi and ci are determined by the intrinsic properties analysis 
module 12, for each coefficient xi. The parameter ai represents the degree of 
interference with the original signal, the parameter bi the degree of auto-interference of 
the embedded signal, and the parameter ci is the site attenuation parameter. 

Application parameters ai, bi and ci are used to take account of a desynchronisation 
phenomenon at each site, i.e. on each carrier frequency of the transform space. For 
example, for a desynchronisation Ai at the i-th site, representing the location precision of 
the coefficient, the values defined by relationship (5) in Appendix II to the description 
will typically be used. 
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• Based on the parameters <pi, a xi 2 , ai, bi and ci supplied by modules 1 1 and 12, module 13 
estimates the global insertion parameters X and %. Based on these global insertion 
parameters, module 13 then determines the insertion parameters yi and a™ defining the 
intrinsic properties of the marking signal. The first insertion parameter yi represents the 
5 attenuation factor of the site considered, and the second insertion parameter o\vi 
■ represents the marking energy weighting term. 

Once the various parameters have been established, insertion of the message M into the 
transformed signal {xi} is performed by the mixing module 10 based on the application 
parameters ai, bi and ci, calculated by module 12, the perceptual weighting {(pi) and the 
10 variance {a xi 2 } calculated by the signal analysis module 1 1, and the insertion parameters 
Owi and yi estimated by module 13. 

The mixing module includes a demodulator 15 which estimates the response rx of the 
transformed original signal to a demodulation of a first set of carriers {Gj}. This 
demodulation takes into account the perceptual weighting values (pi and the attenuation 
15 factor values yi. 

The mixing module 10 also includes a carrier generator 16 which generates the first set 
of m carriers {Gj} based on public or private keys. Each component rxj of the response 
of the transformed original signal is determined from the relationship £ ie [i,„]<pi(yi.xi).Gjj, 
where Gij denotes the i-th coefficient of the j-th carrier supplied by the carrier generator 
20 16. 

The mixing module 10, illustrated in Fig. 5, also includes a message formatting module 
14 capable of providing m components bj defining the message to be embedded, based 
on the responses rxj supplied by the demodulator 15 and a set of code words U applied 
to the formatting device 14 at the same time as the marking message M. 

25 The values of the n coefficients {yi} of the signal after marking are then calculated from 
these components bj, via a modulator 18, an adder 20 and a scaling module 17, in 
accordance with relationship (6) in Appendix II to the description. 
More precisely, for each of the bits of the public or private keys, the carrier generating 
device 16 supplies the carriers Gij to the modulator 18 to modulate the components bj. 

30 The modulator 18 performs a modulation of the components bj of the marking 
information by the carriers Gy to provide n coefficients relating to the marking 
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information. The i-th coefficient relating to the marking information is given by the 
relationship 2j € [i im ]hjGij- 

The modulator 18 can also perform an amplitude modulation of these coefficients 
relating to the marking information, by the term k 2i = o W i / 2je[i,m]Gij , relating to the 
energy weighting term of the marking message a wi and to the carriers Gy. 
The modulator 18 then provides to the adder circuit 20 a number n of coefficients 
relating to the marking information of the form: 
x'i = o wi / 2 jel , (m ]Gij 2 * 2j e [i,m]bjGij. 

The adder circuit 20 adds these coefficients x'i to the coefficients xi of the transformed 
original signal. This result is then scaled by the scaling module 17 based on the term k u 
= axi 2 / Co* 2 + owi 2 ), expressed in relation to the values of the variance o xi 2 of the signal 
in the transform space for the various coefficients xi and of the energy weighting term 
Owi of the added mark. This term corresponds to a Wiener filter. 

The scaling module 17 therefore provides the signal marked with coefficients yi in the 
1 5 transform space, as indicated by relationship (6) in Appendix II. 

The formatting module 14 of the mixer 10 will now be described in more detail. The 
formatting module 14 receives a message M to be embedded, which is defined on the 
basis of a set of code words U. This set is of size 2 C+R and is divided into 2 C subsets U M . 
Each of these subsets includes 2 R code words and are associated with each of the 2 C 
possible messages. The various code words are defined in an m-ary space and are such 
that: 1/m. Ej (U 2 kj ) = 1 for j e [l,m]. 

Any method of generating these code words and grouping these code words into subsets 
U M can be used. These notably include code words generated by a system of correction 
codes (for example, the first C bits are useful bits which identify the message, while the 
last R bits are host signal adaptation bits which identify the code word used for the 
message M). 

The formatting module 14 also receives the response rx of the transformed original 
signal, supplied by the demodulator 15. To determine the components rxj of this 
response, the demodulator 15 first provides an estimate of these according to the 
relationship S M i,n]<pi (Yi xi).Gij, as indicated above. Then it renormalises this estimate 
into rxj in an appropriate manner such that the insertion of rxj, using the technique 
proposed previously by the relationship (6), compensates the response of the host signal 
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at the point of attack in question defined by the attack parameters, whether this attack 
takes the form of added noise and filtering or partial desynchronisation. 
The formatting module 14 then looks for a code word U k , among the code words 
associated with the message M to be inserted, by minimising the square deviation 
criterion defined by relationship (7) in Appendix II to the description, based on the 
response rx to the transformed original signal. This code word represents a vector U k 
having m components U k j. 

Based on this code word U k and the response rx supplied by the demodulator 15, the 
formatting module 14 defines a vector V of dimension m having components defined 
by relationship (8) in Appendix II, where the notation <A|B> = SAjBj represents the 
scalar product between two vectors A and B. 

Based on this vector V, the formatting module 14 defines a vector V of components Vj 
according to relationship (9) in Appendix II, such that the vector V is proportional to the 
vector V and <V|V> = 0 or <V|V> = m applies depending on whether or not V is null. 
15 In particular, this vector V has the property of being orthogonal to the vector U k . 

The formatting module 14 then looks for the value of a parameter 9 maximising the 
relationship (10) formulated in Appendix II, based on parameters uO, v0 and K 
determined in relation to the response to the transformed original signal rx, the vector 
U k and the vector V. These parameters uO, vO and K are defined by the relationships 
20 (11) also included in Appendix II. 

Finally the formatting module 14 calculates the values of the components bj from the 
parameter G thus determined, and of the components U kj and Vj of the vectors U k and V, 
according to relationship (12) in Appendix DL 

The purpose of calculating values of the components bj is to define the signal to be 
added such that the response of the demodulator used in the extraction phase is 
consistent With that of the code word U k and as robust as possible. The robustness is 
defined by equation (10). This robustness corresponds to an energy level of the noise 
that can be added without leaving the cone associated with the code word U k in Fig. 6. 
Referring to Fig. 6, the vectors U k , represented by the vector u, and the vector V, 
represented by the vector y, form a normalised orthogonal base defining the hyperplane 
containing the response vector rx and the code vector U k . In this hyperplane, the 
displacement (cos 0, sin 0) defines the signal that can be added. To maximise equation 
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(10), it is then necessary to look for the vector of components bj maximising the 
robustness. Applied to each component of the modulation (i.e. values bj), this is then 
expressed by equation (12). 

Fig. 6 presents a geometric interpretation of this definition. The cone represented by the 
cross-hatched area represents the set of values leading to correct decoding of the code 
word. Sp represents the sent of points satisfying a power constraint P of the signal 
capable of being added (here P=l). The vector w corresponds to a vector of components 
bj and x corresponds to the vector rx. The hyperbolae Hn correspond to the responses of 
constant robustness (i.e. following the addition of a noise of given energy). 
A similar principle of signal definition has been proposed by Cox et al in an article 
entitled "Watermarking as communications with side information", Proc. IEEE, 
87(7): 1 127-1 141,1999, in the context of watermarking applied directly to the original 
signal, and in a detection context. Detection differs from extraction in that the presence 
of a known message U is sought. Also, the interpretation of the parameter K in equation 
15 (10) differs. In the paper by Cox et al, the parameter K is linked to a presence 
hypothesis test, whereas in extraction it ensures that the correct message is decoded 
(opening of the cone in Fig. 6 then depends on the dictionary used - see equation (11)). 
This technique aimed at limiting host signal interference corresponds to the technique of 
channel coding with side information. The general principle of this channel coding 
20 technique was initially proposed by Costa in an article entitled "Writing on dirty paper", 
IEEE Trans. Info. Thy, 29(3):439-441, May 1983. In the context of the invention, this 
technique is applied on the information obtained from demodulation of the carriers Gy. 
The global insertion parameters definition module 13 defining the intrinsic properties of 
the marking device is described in greater detail below. The definition module 13 first 
25 looks for the global parameters pair (KjC> to define the insertion parameters. 

The optimal pair sought can be defined by specifying two of the following three 
properties: 

- insertion distortion between the original signal x and the marked signal y, in the 
transform space, calculated according to a relationship similar to that given by 

30 relationship (1) in Appendix II; 

- maximum allowable attack distortion between the original signal x and the 
resynchronised marked signal y', in the transform space; 
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- the performance measure Eb/N 0 of the marking system. 

For example, for given distortions Dxy and Dxy, the system looks for the pair (X,x) 
yielding the highest value of the ratio Eb/N 0 , or for given Eb/N 0 and D xy , the system 
looks for the pair (X,x) yielding the highest value of Dxy, or for given Eb/N 0 and Dxy, the 
system looks for the pair (A,,x) yielding the smallest value of Dxy. 

The values of Dxy, Dxy' and Eb/No are expressed in relation to (X,x) according to 
relationships (13) and (14) formulated in Appendix II to the description. 

Having determined the global insertion parameters (A,,x)> module 13 then determines the 
insertion parameters yi and a W i; yi and o W i are auxiliary working variables, functions of 
X and & which define the insertion properties for a site i corresponding to the position of 
a coefficient xi in the spectrum of the transformed signal. For a site i, given the global 
parameters (X,x) and the local parameters ai, bi, ci and o xi , the pair (yi, a W i) is 
determined by executing the steps of the flow diagram illustrated in Fig. 8. 

At step 100, a wi is sought, in the interval [0, (pWX,a x 2 /ci] which maximises function (16) 
of Appendix II, with yi given by relationship (1 7) in Appendix II. 

2 2 

At step 102, for the point found, the device tests whether yi > 0 and yi < [o xl I (a X i + 
awi 2 )]: 

- If yi > 0 and yi < [c x i 2 / (o x 2 + <* w 2 )L the pair (yi, o wi ) is retained at step 104; 

- If not, at step 106, the pair (yi=l, a W i = 0) is used. That is no marking is performed at 
this site. 

In particular, in the case where ai = bi: 

- if X > % or if Oxi < [ci / (q>W aiV {%- X)], the pair (yi, o W i) given by relationships (18) in 
Appendix II is used; 

- if not, the pair (yi = 1 , a wi = 0) is retained. 

It will be noted that when ai = bi = 1 , o W i = (pi.a x 2 . V ^ / ci. 

The theoretical basis underpinning the developments described above is as follows. The 
different expressions used to define the insertion parameters correspond to expressions 
associated with a statistical model of the various signals and with a fairly generalised 
attack model. The coefficients xi are assumed to obey a Gaussian probability law with a 
mean of 0 and variance o x 2 5 and are assumed to be independent. The attacks considered 
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. are of the "scaling" type (factors yi) and the addition of Gaussian noise of variance a 5 2 . 
In this case: yi' = (yi/ywDyi +5i with y wi = a x 2 /(a w 2 + Ox 2 )- 

The scale factor also makes it possible to take proper account of the filtering techniques 
that can be applied. The novelty of the approach proposed here is to consider signals 
5 that are not identically distributed, the use of a perceptual metric, the inclusion of partial 
• desynchronisation, and the use of an insertion/extraction technique based on the use of a 
spread spectrum COFDM (Coded Orthogonal Frequency Division Multiplex) 
modulation applied to all of the coefficients. 

To define the parameters o wi 2 defining the insertion energy, it is also possible to 
10 consider a game between an attacker and a defender according to game theory. The 
attacker knowing the system used tries, in accordance with the known Kerckhoff s 
principle, to minimise the performance measure of the system Eb/N 0 under a maximum 
attack distortion constraint Dxy_max. 

On the other hand, the defender seeks to maximise this performance measure under a 
15 maximum insertion distortion constraint D^max. In the present case, Eb/N 0 represents 

the signal to noise ratio between the energy of the hidden message and the attack noise. 

This problem can then be solved using a Lagrangian formulation of the problem. The 

Lagrange factors X > 0 and % > 0 are then introduced, and the following subproblem 

dependent on (X,y£) is considered, namely to find a general solution to equation (15) 
20 defined in Appendix II to the description. 

The general solution is defined as the solution associated with the pair (k,y) resulting in 

a solution such that D xy ' = Dxy_max and D xy = Dxy_max. 

In the above description, the search is located on (X,x) in order to satisfy the distortion 
constraints. The expression to be maximised at step 100 corresponds to the term {Eb/No 
25 + X.Dxy - X-Dxy}. The last two terms being the Lagrangian terms associated respectively 
with the insertion and attack distortion. The terms associated with the constraint 
Dxy'_max and D xy _max have been removed as they are constant, and also for reasons of 
simplicity. 

It will be noted that the minimisation on the attack parameters (yi, o 8i ) has already been 
30 taken into account notably in the definition of the parameter yi in the first step. 

The extraction of an embedded message following attacks is accomplished in two 
phases in the extraction device 2. In a first phase, a linear demodulation is performed in 
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order to obtain observations bj withje[l,m]. The extracted message is then defined by 
seeking the code word close to the observations. 

In the extraction device 2, the marked signal yi is resynchronised by the 
resynchronisation module 8, then transformed by the transformation module 7 into a 
series of coefficients yi' using a transformation identical to that applied in the insertion 
phase. 

The extraction module illustrated in Fig. 7 includes a demodulator 21 coupled to an 
extracted message decoder. The demodulator 21 calculates a response of the signal {yi'} 
to a demodulation of a second set of carriers Gj supplied by a carrier generator 23, 
according to relationship (19) in Appendix II. This demodulation takes into account the 
perceptual weighting (pi calculated on the basis of an analysis performed by a module 24 
analysing the signal yi'. 

The demodulation is based on the extraction of an estimate of the inserted message 
bjby relationship (19) in Appendix n, at all of the marked sites. 

It will be noted that any estimator defining a response proportional to this estimator can 
also be considered. 

In an alternative embodiment, the second set of carriers is identical to the first set of 
carriers produced by the carrier generating module 1 6 of the insertion module. 

Decoding of the message takes place after its estimated formatting bj. It involves 
finding the code word U k closest to the estimated values bj by relationship (20) defined 
in Appendix II. 

The message associated with the code word U k then corresponds to the extracted 
message. An exhaustive search process can be used to perform the search for the closest 
code word, or any rapid search technique related to the definition of the code words 
used, by the use of a channel code decoding technique, for example. 
It is to be noted that the invention is not limited to the embodiments described above. 
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APPENDIX! 

T-1 Signals : 

- n: number of signal coefficients in the transform domain, 

- xi, i e [1, n]: values of signal coefficients in the transform space, 

5 ' - yi, i e [1, n]: values of signal coefficients in the transform space after marking. 

- yi', i e [1, n]; values of signal coefficients in the transform space after marking, attacks 
and resynchronisation. 

. o- xi 2 , i e[l, n]: variance values of the signal in the transform space for the different 
coefficients. 

l0 - Dxy = SDxyli: distortion between two signals x and y defined by relationship (1) 
formulated in Appendix II to the description. 

- (pi; perceptual weighting for the i-th coefficient in the distortion metric. These weights 
are defined in relation to the type of signal processed, the transformation used, and the 
signal values observed. 

15 - Dxy: insertion distortion. 

- Dxy*: attack distortion. 

- (ai, bi, ci): variables identifying the system properties relative to the different insertion 
coefficients (variables between 0 and 1). 

- ai: degree of interference with the original signal. 

20 - bi: degree of auto interference of the inserted signal. 

- ci: attenuation parameter of a site (for example associated with its sensitivity to 
desynchronising attacks); this term depends on the transform space used and on the 
order of magnitude of the estimated desynchronisation error following the 
resynchronisation performed on extraction, and on the allowable degradation. 



25 



T-2 Working variables : 

- global auxiliary working variables used to define the insertion parameters of 

each coefficient in the transform domain. 



WO 2004/043072 



17 



PCT7FR2003/003208 



- (yi, Gwi), ie {1, n}: auxiliary working variables defining the insertion parameters of 
each coefficient. 

- yi: attenuation factor. 

- c w i: energy weighting term of the mark added. 

5 

1-3 Modulation : 

- m: number of carriers used on insertion of the message. 

- bj with j e .{1, .., m): information defining the information to be added in order to 
insert the message. 

10 - Gy with (i, j) e {1,.., n} x {1,.., m}: information defining the known message insertion 
carriers at insertion and extraction. Any method of generating such carriers can be 
considered provided that they satisfy the condition Ei j [Gy] = 0 and Ei j [Gy ] = 1 . 

They can be generated for example via a secret key and a random number generator 
controlled by this secret key. 

: 15 / 

1-4 Code word dictionary 

- 2 C : number of existing messages capable of being embedded in the signal. 

- U: set of code words used. 2 C m-ary code words are defined, and grouped into 2 
subsets Um associated with the various existing messages M. 

20 - U k : code word used, of size m and defined by the values U k j with j e { 1 m} . 
1-5 Perceptual parameter 

(pi: perceptual weights of distortion of the signal coefficients. 
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APPRNDIXII 

T ist of fonnn la«» used in the description 
«pi 2 =»l VI 2 ) (3) 



io vx-o^^ixin/ivii 



(4) 



ai=bi = l -c^^vifeci = (5inc(Ai)) d (5), 

where sine (x) = sin (7ex)/jex and d, the dimension of the signal considered (1 for a ID audio 

signal, 2 for an image, etc). .'..■„ 

15 

. yi-k ll .(xi + k ai .Zj«(i^j<>j-G5)) 
with: ka-(pf/.CaJ + **) 

V»j =i3g - <rxIU^/i/"<J k lU,^ (8) 

Si<^HV»> = 0,Vj~0 (9) 
25 .Otherwise, Vj = Vj\ fM\V<V*\V>. 



{K.(u 0 + cos 8)* - (v 0 + sin 6) 2 } (10) 
with: 



30 



Uo = 1/m <rx fU^ (11) 
v 0 = l/m<rx|V> 
K = l/(2 2 - <c+R > /,n -l) 
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10 



bj - U kJ -cos 8 + Vj.sin 6 



.=1 



(12) 



(13) 



. -with: 

D^-^o/O/^^o^'' . . ° 4) 

(Eb/Na)|i = «pi 2 V*Xci Yi 

(15) 



max^ {min^^Eb/H, + -D^max) - X <P^ - D^max)}} 

(EMN.)|i + X D v |i - z DJi 06> 
15 yi- [o* 2 - ci I (<piV X)] I L(l-aW + .a*0««fl < 17) 

with: 

20 Aj-^A-xa-ai)) 
61 = 2 q>i V~A.'ci 
Di=cLa^(<pLV*X) 



25 



bj-Iie^^iyi'Gg) 

with Iw = set of marked sites. 



(19) 



0 
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(20) 



